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METHODS USING JVlECHANlSM&ljf ACTlOaL OF AKO A 

FIELD OF THE INVENTION 

5 This invention relates to newly identified polynucleotides and polypeptides, and their 

production and uses, as well as their vanants, agonists and antagonists, and their uses. In 
particular, in these and in other regards, the invention relates to novel polynucleotides and 
polypeptides of the arc (5-enolpyruvylshikimate-3-phosphate synthase) fannily. hereinafter 
referred to as "aroA". 



10 



BACKGROUND OF THE INVENTION 



The Streptococci make up a medically important genera of microbes known to cause 
several types of disease in humans, including, for example, otitis media, conjunctivitis, 
15 pneumonia, bacteremia, meningitis, sinusitis, pleural empyema and endocarditis, and most 
particularly meningitis, such as for example infection of cerebrospinal fluid. Since its 
isolation more than 100 years ago. Streptococcus pneumoniae has been one of the more 
intensively studied microbes. For example, much of our early understanding that DNA is, in 
fact, the genetic material was predicated on the work of Gnffith and of Avery, Macleod and 
20 Mccarty using this microbe. Despite the vast amount of research wi,h S. pneimomae, many 
questions concerning the virulence of this microbe remain. It is particularly preferred to 
employ Streptococcal genes and gene products as targets for the development of antibiotics. 

The frequency of Streptococcus pneumoniae infections has risen dramatically in the 
past 20 years. This has been attributed to the emergence of multiply antibiotic resistant strains 
25 and an increasing population of people with weakened immune systems. It is no longer 
uncommon to isolate Streptococcus pneumoniae strains which are resistant to some or all of 
the standard antibiotics. This has created a demand for both new anti-microbial agents and 
diagnostic tests for this organism. 

Clearly, there is a need for factors, such as the novel compounds of the invention, that 
30 have a present benefit of being useful to screen compounds for antibiotic activity. Such 
factors are also useful to determine their role in pathogenesis of infection, dysfunction and 
disea.se. There is also a need for identification and characterization of such factors and their 
antagonists and agoni.sts which can play a role in preventing, ameliorating or correcting 
infections, dysfunctions or diseases. 
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The polypeptides of the invention have amino ac.d sequence homology to a known 
Lactococcus lactts 5-enolpyruvoylshikimatc -3-phosphaie synthase (aro A) protein. 

SUMMARY OF THE INVENTION 

5 

It IS an object of the invention to provide polypeptides that have been identified as 
novel aroA polypeptides by homology between the amino ac.d sequence set out m Table 1 
[SEQ ID NO; 2] and a known amino acid sequence or sequences of other proteins such as 
Lactococcus lactis 5-enolpyruvoylshikimate -3-phosphate synthase (aro A) protein. 
10 It is a further object of the invention to provide polynucleotides that encode aroA 

polypeptides, particularly polynucleotides that encode the polypeptide herein designated aroA. 

In a particularly preierred embodiment of the invention the polynucleotide comprises 
a region encoding aroA polypeptides compnsing the sequence set out in Table 1 [SEQ ID 
NO;l] which includes a full length gene, or a variant thereof. 
15 In another particularly preferred embodiment of the invention there is a novel aroA 

protein from Streptococcus pneumoniae comprising the amino acid sequence of Table 1 
[SEQ ID N0:2]. or a variant thereof. 

In accordance with another aspect of the invention there is provided an isolated 
nucleic acid molecule encoding a mature polypeptide expressible by the Streptococcus 
20 pneumoniae 0 1 00993 strain contained in the deposited strain. 

A further aspect of the invention there are provided isolated nucleic acid molecules 
encoding aroA, particularly Streptococcus pneumoniae aroA. including mRNAs, cDNAs, 
genomic DNAs. Further embodiments of the invention include biologically, diagnostically, 
prophylactically, clinically or therapeutically useful variants thereof, and compositions 

25 comprising the same. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants of aroA and polypeptides encoded thereby. 

30 Another aspect of the invention there are provided novel polypeptides of 

Streptococcus pneumoniae referred to herein as aroA as well as biologically, diagnostically, 
prophylactically, clinically or therapeutically useful vananls thereof, and compositions 
comprising the same. 
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Among the particularly preferred embodiments of the mvenlion are variants of aroA 
polypeptide encoded by naturally occumng alleles of the aroA gene. 

In a preferred embodiment of the invention there are provided methods for producing 
the aforementioned aroA polypeptides. 
5 In accordance v.ith yet another aspect of the invention, there are provided inhibitors 

to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 

In accordance with cenain preferred embodiments of the invention, there are provided 
products, compositions and methods for assessing aroA expression, treating disease, for 
example, otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleural 
10 empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid, assaying genetic variation, and administering a aroA polypeptide or 
polynucleotide to an organism to raise an immunological response against a bacteria, 
especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the 
15 invention there are provided polynucleotides that hybndize to aroA polynucleotide sequences, 
particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies 

against aroA polypeptides. 

In other embodiments of the invention there are provided methods for identifying 
20 compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
polypeptide or polynucleotide of the invention compnsing: contacting a polypeptide or 
polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide 
to assess the binding to or other interaction with the compound, such binding or interaction 
25 being associated with a second component capable of providing a detectable signal in 
respLe to the binding or interaction of the polypeptide or polynucleotide with the 
compound; and determining whether the compound binds to or otherwise interacts with and 
activates or inhibits an activity of the polypeptide or polynucleotide by detecting the presence 
or absence of a signal generated from the binding or interaction of the compound with the 
30 polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided aroA 
agonists and antagonists, preferably bacteriostatic or bacteriocidal agonists and antagonists. 
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In a further aspect of the invention there are provided compositions compnsing a 
aroA polynucleotide or a aroA polypeptide for administration to a cell or to a multicellular 
organism. 

The invention provides an antagonist that inhibits or an agonist that activates an 
5 activity a polypeptide selected from the group consisting of: a polypeptide composing an 
amino acid sequence which is at least 90% identical to the amino acid sequence of SEQ ID 
NO:2 or 4, and a polypeptide comprising an amino acid sequence as set forth in SEQ ID NO:2 
or 4. 

In any of the methods herein using or referring to an activity of AroA or a polypeptide 
10 of the invention, said activity is selected from the group consisting of. synthesis of p- 
aminobenzoate. synthesis of ubiquinone, transformation of phospho(enol)pyruvate (PEP) and 
shikimate 3-phosphatc (S3P) to EPSP and inorganic phosphate (Pi), transformation of EPSP 
and inorganic phosphate (P.) to phosphoienoDpynivate (PEP) and .shikimate 3-phosphate 
(S3P), binding of AroA and phospho(enol)pyruvate, binding of AroA to 
15 phospho(enoDpyruvate-pyruvate kina.se complex. binding of AroA and 
phospho(enol)pymvate-laciacte dehydrogenase complex, binding of AroA and shikimate 3- 
phosphate. competitive inhibition of the forward reaction of AroA by glypho.sate (herein also 
"GLP") versus phospho(enol)pyruvate,uncompetitive inhibition of the forward reaction of 
AroA by glyphosate versus shikimate 3-phosphate. competitive inhibition of the forward 
20 reaction of AroA by EPSP versus phospho(enol)pymvate. competitive inhibition of the 
forward reaction of AroA by EPSP versus shikimate 3-phosphate, uncompetitive inhibition of 
the reverse reaction of AroA by glyphosate versus EPSP. noncompetitive inhibition of the 
reverse reaction of AroA by glyphosate versus Pi, competitive inhibition of the reverse 
reaction of AroA by shikimate 3-phosphate versus EPSP. and uncompetitive inhibition of the 
25 reverse reaction of AroA by shikimate 3-phosphate versus Pi. 

Further provided by the invention is a method for the treatment of an individual 
having need to inhibit or activate AroA polypeptide compnsing the steps of: administering to 
the individual a antibacterially effective amount of an antagonist that inhibits or an agonist 
that activates an activity of a polypeptide selected from the group consisting of: a polypeptide 
30 comprising an amino acid sequence which is at least 90% identical to the amino acid sequence 
of SEQ ID NO:2 or 4, and a polypeptide comprising an amino acid sequence as set forth in 
SEQ ID NO:2 or 4 

Also provided is a method for the treatment of an individual infected with a bacteria 
compnsing the steps of administenng to the individual a antibactenally effective amount of an 
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antagonist that inhibits or an agonist that activates an act.v.ty of a polypepi.de selected from 
the group consisting of: a polypeptide compnsmg an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ID NO:2 or 4. and a polypeptide composing 
an amino acid sequence as set forth in SEQ ID N0.2 or 4. 
5 In any of the methods of the ivention. it is preferred that a bactena is selected from 

the group consisting of a member of the genus Staphylococcus. Staphylococcus aureus, a 
member of the genus Streptococcus, and Streptococcus pneumoniae. 

Also provided by invention is a method for the treatment of an individual having need 
to inhibit or activate AroA polypeptide compnsing the steps of administering to the individual 
10 a antibacterially effective amount of an antagonist that inhibits or an agoinist that activates an 
activity of AroA. 

' The invention also provides a method for the treatment of an individual infected wuh 
a bactena comprising the steps of administering to the individual a antibactenally effective 
amount of an antagonist that inhibits or an agonist that activates that activates an activity of 
15 AroA. 

A method is also provided for the treatment of an individual infected by a bactena 
comprising the steps of administering to the individual an antibactenally effective amount of 
a compound that is a competitive inhibitor of shikimate-3-phosphate substrate utilization by 
AroA. m a preferred embodiment a compound of the invention, such as an antagonist or 
20 agonist, binds to or interacts with the active site of AroA or interacts with shikimate-3- 
phosphate or interferes with the binding of shikimate-3-phosphate. 

A method for the treatment of an individual infected by Streptococcus pneumomae 
comprismg the steps of administering to the individual a antibacterially effective amount of an 
antagonist that inhibits or anagonist that activates an activity of Streptococcus pneumoniae 
25 AroA is also provided by the invention . 

Further provided is an antagonist that inhibits an activity of a polypeptide selected 
from the group consisting of: a polypeptide comprising an amino acid sequence which is at 
least 90% identical to the amino acid sequence of SEQ ID NO:2 or 4. and a polypeptide 
comprising an amino acid sequence as set forth in SEQ ID NO:2 or 4. 
30 A method is further provided by the invention for the treatment of an individual 

having need to inhibit AroA polypeptide comprising the steps of administenng to the 
individual a antibacterially effective amount of an antagonist that inhibits an activity of a 
polypeptide selected from the group consisting of: a polypeptide compnsing an amino acid 
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sequence which is at least 907c identical to the amino acid sequence of SEQ ID NO:2 or 4. 
and a polypeptide composing an ammo acid sequence as set forth in SEQ ID NO:2 or 4. 

Still further provided is a method for inhibiting an activity of AroA polypeptide 
comprising the steps of contacting a composition comprising said polypeptide with an 
5 effective amount of an antagonist that inhibits an aciivbity of AroA. 

Another embodiment of the invention is a method for inhibiting a growth of bacteria 
comprising the steps of contacting a composition comprising bactena with an antibacterially 
effective amount of an antagonist that inhibits an activity of AroA. 

A still further embodiment of the invention is a a method for inhibiting a AroA 
10 polypeptide comprising the steps of contacting a composition comprising bacteria with an 
antibacterially effective amount of an antagonist that inhibits an activity of AroA. 

Various changes and modifications within the spirit and scope of the disclosed 
invention will become readily apparent to those skilled in the art from reading the following 
descriptions and from reading the other parts of the present disclosure. 
1 5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Inhibition of EPSP synthase by GLP in the forward reaction. Panel A: 
Competitive inhibition versus PEP determined at fixed S3P (1 mM); [GLP] = 0 (o), 0.5 ' M 
(•), 3 microMolar (n), and 6 microMolar (■). Panel B: Uncompetitive inhibition versus 
S3P detennined at fixed PEP ( 1 mM); [GLP] = 0 (o), 25 microMolar (•), 50 microMolar 
20 (□), and 100 microMolar (■). 

Figure 2. Inhibition of EPSP synthase by EPSP in the forward reaction. Panel A: 
Competitive inhibition versus PEP detemiined at fixed S3P (1 mM); [EPSP] = 0 (o), 50 
microMolar (•). 100 microMolar (□). 200 microMolar (■), and 400 microMolar ( . ). Panel 
B: Competitive inhibition versus S3P determined at fixed PEP (1 mM); [EPSP] = 0 (o), 25 
25 microMolar (•). 50 microMolar (□). 75 microMolar (■), and 100 microMolar (I ). 

Figure 3. Inhibition of EPSP synthase by GLP in the reverse reaction. Panel A: 
Mixed inhibition versus EPSP determined at fixed Pi (10 mM); [GLP] = 0 (o), 0.25 mM 
(•), 0.5 mM {□), 0.75 mM (■), and 1 mM ( ). Panel B: Mixed inhibition versus Pi 
determined at fixed EPSP (0.5 mM); [GLP] = 0 (c ). 0.1 mM (•), 0.5 mM (□). and 1 mM 
30 (■). 

Figure 4. Inhibition of EPSP synthase by S3P in the reverse reaction. Panel A: 
Competitive inhibition versus EPSP at fixed Pi (10 mM): [S3P] = 0 (o), 0.1 mM (•), 0.25 
mM (□). and 0,5 mM (■). Panel B: Uncompetitive inhibition versus Pi at fixed EPSP (0.5 
mM); [S3PJ = 0 (o). 0.1 mM (•). 0.25 mM (3). and 0.5 mM (■). 
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Figure 5 (Schemes 1 and 2) shows schemes for the EPSP synthase-catalyzed 
reaction and the formation of the PEP oxonium ion. 

Figure 6 (Scheme 3) shows a proposed mechanism for the forward reaction of 5, 

pneumoniae EPSP synthase. 
5 GLOSSARY 

The followmg definitions are provided to facilitate understanding of certain terms 
used frequently herein. 

"Bodily material(s) means any material derived from an individual or from an organism 
10 infecting, infesting or inhabiting an individual including but not limited to, cells, tissues and 
waste, such as, bone, blood, serum, cerebrospinal fluid, semen, saliva, muscle, cartilage, organ 
tissue, skin, urine, stool or autopsy materials. 

"Disease(s)" means any disease caused by or related to infection by a bacteria, 
including , for example, infections of the upper respirator>' tract (e.g., otitis media, bacterial 
15 tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), 
cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, splenic absces, 
retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, 
keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), kidney and unnary 
tract (e.g. , epididymitis, intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., 
20 impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, bacterial myositis) bone 
and joint (e.g., septic arthritis, osteomyelitis). 

"Host cell(s)" is a cell that has been introduced {e.g., transformed or transfected) or is 
capable of introduction {e.g,, transformation or transfection) by an exogenous polynucleotide 
sequence. 

25 "Identity," as known in the art, is a relationship between two or more polypeptide 

sequences or two or more polynucleotide sequences, as the case may be, as determined by 
comparing the sequences. In the art, "identity" also means the degree of sequence relatedness 
between polypeptide or polynucleotide sequences, as the case may be, as determined by the 
match between strings of such sequences. "Identity" can be readily calculated by known 

30 methods, including but not limited to those described in (Computational Molecular Biology. 
Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and 
Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1993: Computer Analysis of 
Sequence Data. Part I, Griffin, A.M., and Griffin, H.G.. eds., Humana Press, New Jersey, 
1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 
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Sequence Analysis Primer. Gnbskov, M. and Devereux, J., eds., M Stockton Press, New 
York, 1991; and Carillo, H., and Lipnnan. D., SIAM J. Applied Math., 48: 1073 (1988). 
Methods to determine identity are designed to give the largest match between the sequences 
tested. Moreover, methods to determine identity are codified in publicly available computer 

5 programs. Computer program methods to determine identity between two sequences include, 
but are not limited to. the GCG program package (Devereux, J., et al.. Nucleic Acids Research 
12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S.F. et al., J. Molec. BioL 
215: 403-410 (1990). The BLAST X program is publicly available from NCBI and other 
sources (BLAST Manual, Altschul, S., et al.. NCBI NLM NIH Bethesda, MD 20894; 

10 Altschul, S.. et al. J. Mol BioL 215: 403-410 (1990). The well known Smith Waterman 
algorithm may also be used to determine identity. 

Parameters for polypeptide sequence comparison include the following: Algorithm: 
Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc, Natl. Acad. Sci. 
15 USA. 89:10915-10919(1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program useful with these parameters is publicly available as the "gap" program from 
Genetics Computer Group, Madison WL The aforementioned parameters are the default 

20 parameters for peptide comparisons (along with no penalty for end gaps). 

Parameters for polynucleotide comparison include the following: Algorithm: 
Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 

25 Gap Length Penalty: 3 

Available as: The "gap" program from Genetics Computer Group. Madison WI. These are 
the default parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case 
may be, are provided in ( 1) and (2) below. 

30 ( 1 ) Polynucleotide embodiments further include an isolated polynucleotide 

comprising a polynucleotide sequence having at least a 95, 97 or 100% identity to the 
reference sequence of SEQ ID NO:l OR 3, wherein said polynucleotide sequence may be 
identical to the reference sequence of SEQ ID NO: 1 OR 3 or may include up to a certain 
integer number of nucleotide alterations as compared to the reference sequence, wherein said 
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alterations are selected from the group consisting of at least one nucleotide deletion, 
substitution, including transition and transversion, or insertion, and wherein said alterations 
may occur at the 5' or 3' terminal positions of the reference nucleotide sequence or anywhere 
between those terminal positions, interspersed either individually among the nucleotides in 
5 the reference sequence or in one or more contiguous groups within the reference sequence, 
and wherein said number of nucleotide alterations is determined by multiplying the total 
number of nucleotides in SEQ ID NO: 1 OR 3 by the integer defining the percent identity 
divided by 100 and then subtracting that product from said total number of nucleotides in 
SEQ IDNO:l OR 3, or: 

10 

nn < Xn - (Xp • y), 

wherein Hj^ is the number of nucleotide alterations, x^ is the total number of nucleotides in 
SEQ ID NO:l OR 3, y is 0,95 for 95%, 0.97 for 97% or 1.00 for 1007c, and • is the symbol 

15 for the multiplication operator, and wherein any non-integer product of and y is rounded 
down to the nearest integer prior to subtracting it from x^^. Alterations of a polynucleotide 
sequence encoding the polypeptide of SEQ ID NO:2 OR 4 may create nonsense, missense or 
frameshift mutations in this coding sequence and thereby alter the polypeptide encoded by the 
polynucleotide following such alterations. 

20 (2) Polypeptide embodiments further include an isolated polypeptide comprising 

a polypeptide having at least a 95, 97 or 100% identity to a polypeptide reference sequence of 
SEQ ID NO:2 OR 4, wherein said polypeptide sequence may be identical to the reference 
sequence of SEQ ID NO:2 OR 4 or may include up to a cenain integer number of amino acid 
alterations as compared to the reference sequence, wherein said alterations are selected from 

25 the group consisting of at least one amino acid deletion, substitution, including conservative 
and non-conservative substitution, or insertion, and wherein said alterations may occur at the 
amino- or carboxy-terminal positions of the reference polypeptide sequence or anywhere 
between those terminal positions, interspersed either individually among the ammo acids in 
the reference sequence or in one or more contiguous groups within the reference sequence, 

30 and wherein said number of amino acid alterations is determined by multiplying the total 
number of amino acids in SEQ ID NO:2 OR 4 by the integer defining the percent identity 
divided by 100 and then subtracting that product from said total number of amino acids in 
SEQ ID NO:2 OR 4, or: 

-9- 



wo 00/68243 



PCT/USOO/12251 



Ha < Xa - (Xa • y), 

wherein is the number of amino acid alterations, x^ is the total number of amino acids in 
5 SEQ ID NO:2 OR 4, y is 0.95 for 95%, 0.97 for 977c or 1 .00 for 1007f , and • is the symbol 
for the multiplication operator, and wherein any non-integer product of Xa and y is rounded 
down to the nearest integer prior to subtracting it from x^- 

"Individual(s)" means a multicellular eukaryote, including, but not limited to a 
metazoan, a mammal, an ovid, a bovid, a simian, a primate, and a human. 

10 "Isolated" means altered "by the hand of man" from its natural state, i.e., if it occurs in 

nature, it has been changed or removed from its original environment, or both. For example, a 
polynucleotide or a polypeptide naturally present in a living organism is not '^isolated," but the 
same polynucleotide or polypeptide separated from the coexisting matenals of its natural state is 
"isolated", as the term is employed herein. Moreover, a polynucleotide or polypeptide that is 

15 introduced into an organism by transformation, genetic manipulation or by any other 
recombinant method is "isolated" even if it is still present in said organism, which organism 
may be living or non-living. 

"Organism(s)" means a (i) prokaryote, including but not limited to, a member of the 
genus Streptococcus, Staphylococcus, Bordetella, Cory nehacterium, Mycobacterium, Neisseria, 

20 Haemophilus, Actinomycetes, Streptomycetes, Nocardia, Enterohacter, Yersinia, Fancisella, 
Pasturella, Moraxella, Acinetobacter, Erysipelothrix, Branhamella, ActinobaciUus, 
Streptobacilhis, Listeria, Calymmatobacterium. Brucella. Bacillus, Clostridium, Treponema. 
Escherichia, Salmonella, Kleibsiella, Vibrio, Proteus, Erwinia, Borrelia, Leptospira, Spirillum, 
Campylobacter, Shigella, Legionella, Pseudomonas, Aeromonas, Rickettsia, Chlamydia, 

25 Borrelia and Mycoplasma, and further including, but not limited to, a member of the species or 
group. Group A Streptococcus, Group B Streptococcus, Group C Streptococcus, Group D 
Streptococcus, Group G Streptococcus, Streptococcus pneumoniae. Streptococcus pyogenes. 
Streptococcus agalactiae. Streptococcus faecalis. Streptococcus faecium. Streptococcus durans. 
Neisseria gonorrheal Neisseria meningitidis. Staphylococcus aureus. Staphylococcus 

30 epidermidis, Corynebacterium diptheriae, Gardnerella vaginalis, Mycobacterium tuberculosis, 
Mycobacterium bovis, Mycobacterium ulcerans, Mycobacterium leprae, Actinomyctes israelii. 
Listeria monocytogenes, Bordetella pertusis, Bordatella parapertusis, Bordetella 
bronchiseptica, Escherichia coli. Shigella dysenteriae, Haemophilus influenzae, Haemophilus 
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aegyptius, Haemophilus par a influenzae . Haemophilus ducreyi, Bordetella, Salmonella typhi, 
Citrobacter freundii, Proteus mirahilis, Proteus vulgaris, Yersinia pestis, Kleihsiella 
pneumoniae, Serratia murcessens, Serratia liquefaciens. Vibrio cholera. Shigella dysenterii. 
Shigella flexneri, Pseiuiomonas aeruginosa, Franscisella tularensis, Brucella abortis, Bacillus 
5 anthracis. Bacillus cereus, Clostridium perfringens, Clostridium tetani, Clostridium botulinum, 
Treponema pallidum, Rickettsia rickettsii and Chlamydia trachomitis, (ii) an archaeon, 
including but not linnited to Archaebacten and uii) a unicellular or filamentous eukaryote, 
including but not limited to, a protozoan, a fungus, a member of the genus Saccharomyces, 
Kluveromyces, or Candida, and a member of the species Saccharomyces ceriviseae, 
10 Kluveromyces lactis, or Candida albicans, 

"Bacieria(um)'" means a (i) prokaryote, including but not limited to, a member of the 
genus Streptococcus, Staphylococcus, Bordetella, Corynebacterium, Mycobacterium, Neisseria, 
Haemophilus, Actinomycetes, Streptomycetes. Nocardia, Enterobacter, Yersinia, Fancisella, 
Pasturella, Moraxella, Acinetobacter Erysipelothrix, Branhamella, Actinobacillus, 
15 Streptobacillus, Listeria, Calymmatobacteriunh Brucella, Bacillus, Clostridium, Treponema, 
Escherichia, Salmonella, Klcibsiella, Vibrio, Proteus, Envinia, Borrelia, Leptospira, Spirillum, 
Campylobacter, Shigella, Legionella, Pseudomonas, Aeromonas, Rickettsia, Chlamydia, 
Borrelia and Mycoplasma, and further including, but not limited to, a member of the species or 
group. Group A Streptococcus, Group B Streptococcus, Group C Streptococcus, Group D 
20 Streptococcus, Group G Streptococcus, Streptococcus pneumoniae. Streptococcus pyogenes. 
Streptococcus agalactiae. Streptococcus faecalis. Streptococcus faecium. Streptococcus durans. 
Neisseria gonorrheac. Neisseria meningitidis. Staphylococci, aureus. Staphylococcus 
epidermidis, Corynebacterium diptheriae, Gardnerella vaginalis, Mycobacterium tuberculosis, 
Mycobacterium bovis, Mycobacterium ulcerans, Mycobacterium leprae, Actinomyctes israelii, 
25 Listeria monocytogenes, Bordetella pertusis, Bordatella parapertusis, Bordetella 
bronchiseptica, Escherichia coli. Shigella dysenteriae, Haemophilus influenzae, Haemophilus 
aegyptius, Haemophilus parainfluenzae, Haemophilus ducreyi, Bordetella, Salmonella typhi, 
Citrobacter freundii, Proteus mirabilis, Proteus vulgaris, Yersinia pestis, Kleibsiella 
pneumoniae, Serratia marcessens, Serratia liquefaciens. Vibrio cholera. Shigella dysenterii, 
30 Shigella flexneri, Pseudomonas aeruginosa, Franscisella tularensis. Brucella abortis. Bacillus 
anthracis. Bacillus cereus, Clostridium perfringens, Clostridium tetani, Clostridium botulinum, 
Treponema pallidum, Rickettsia rickettsii and Chlamydia trachomitis, and (ii) an archaeon, 
including but not limited to Archaebacter. 
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"Polynucleotide(s)" generally refers to any polyribonucleotide or 
polydeoxyribonucleolide, that may be unmodified RNA or DNA or modified RNA or DNA. 
"Polynucleotide(s)" include, without limitation, single- and double-stranded DNA, DNA that is 
a mixture of single- and double-stranded regions or single-, double- and triple-stranded regions, 
5 single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded 
regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded, or triple-stranded regions, or a mixture of single- and double- 
stranded regions. In addition, "polynucleotide" as used herein refers to tnple-stranded regions 
comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from the 

10 same molecule or from different molecules. The regions may include all of one or more of the 
molecules, but more typically involve only a region of some of the molecules. One of the 
molecules of a triple-helical region often is an oligonucleotide. As used herein, the term 
"polynucleotide(s)" also includes DNAs or RNAs as described above that comprise one or more 
modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other 

15 reasons are "polynucleotide(s)" as that term is intended herein. Moreover, DNAs or RNAs 
comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to name 
just two examples, are polynucleotides as the term is used herein. It will be appreciated that a 
great variety of modifications have been made to DNA and RNA that serve many useful 
purposes known to those of skill in the art. The term "polynucleotide(s)" as it is employed 

20 herein embraces such chemically, enzymatically or metaboHcally modified forms of 
polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and 
cells, including, for example, simple and complex cells. "Polynucleotide(s)" also embraces 
short polynucleotides often referred to as oligonucleotide(s)- 

"Polypeplide(s)" refers to any peptide or protein comprising two or more amino acids 

25 joined to each other by peptide bonds or modified peptide bonds. "Polypeptide(s)" refers to both 
short chains, commonly referred to as peptides, oligopeptides and oligomers and to longer 
chains generally referred to as proteins. Polypeptides may comprise amino acids other than the 
20 gene encoded amino acids. "Polypeptide(s)" include those modified either by natural 
processes, such as processing and other post-translational modifications, but also by chemical 

30 modification techniques. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature, and they are well known to 
those of skill in the art. It will be appreciated that the same type of modification may be present 
in the same or varying degree at several sites in a given polypeptide. Also, a given polypeptide 
may comprise many types of modifications. Modifications can occur anywhere in a 
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polypeptide, including the peptide backbone, the amino acid side-chains, and the amino or 
carboxyl termini. Modifications include, for example, acetylation, acylation, ADP-ribosylaiion, 
amidalion, covalent attachment of flavin, covalenl attachment of a heme moiety, covaieni 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
5 derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide 
bond formation, demethylaiion, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formylation, gamma-carboxylation, GPl anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, glycosylation, lipid attachment, sulfation, gamma- 

10 carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation, selenoylaiion, 
sulfation, transfer-RNA mediated addition of amino acids to proteins, such as arginylation, and 
ubiquitinaiion. See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES. 2nd Ed., T. E. Creighion, W. H. Freeman and Company, New York (1993) and 
Wold, P., Posttranslational Protein Modifications: Perspectives and Prospects, pgs. 1-12 in 

15 POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS. B. C. Johnson, Ed., 
Academic Press, New York (1983); Seifter et aL, Meth, EnzymoL 752:626-646 (1990) and 
Rattan et ah. Protein Synthesis: Posttranslational Modifications and Aging, Ann. N.Y. Acad. 
Sci. 663: 48-62 (1992), Polypeptides may be branched or cyclic, with or without branching. 
Cyclic, branched and branched circular polypeptides may result from post-translational natural 

20 processes and may be made by entirely synthetic methods, as well. 

"Recombinant expression sysiem(s)" refers to expression systems or portions thereof 
or polynucleotides of the invention introduced or transformed into a host cell or host cell 
lysate for the production of the polynucleotides and polypeptides of the invention. 

"Variant(s)" as the term is used herein, is a polynucleotide or polypeptide that differs 

25 from a reference polynucleotide or polypeptide respectively, but retains essential properties. 
A typical variant of a polynucleotide differs in nucleotide sequence from another, reference 
polynucleotide. Changes in the nucleotide sequence of the variant may or may not alter the 
amino acid sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide 
changes may result in amino acid substitutions, additions, deletions, fusion proteins and 

30 truncations in the polypeptide encoded by the reference sequence, as discussed below . A 
typical variant of a polypeptide differs in amino acid sequence from another, reference 
polypeptide. Generally, differences are limited so thai the sequences of the reference 
polypeptide and the variant are closely similar overall and, in many regions, identical. A 
variant and reference polypeptide may differ in amino acid sequence by one or more 
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substitutions, additions, deletions in any combination. A substituted or inserted ammo acid 
residue may or may not be one encoded by the genetic code. Tlie present invention also 
includes include vanants of each of the polypeptides of the invention, that is polypeptides that 
vary from the referents by conservative amino acid substitutions, whereby a residue is 
5 substituted by another with like characteristics. Typical such substitutions are among Ala, Val. 
Leu and lie: among Ser and Thr; among the acidic residues Asp and Glu: among Asn and Gin: 
and among the basic residues Lys and Arg: or aromatic residues Phe and Tyr. Particularly 
preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino acids are substituted, 
deleted, or added in any combination. A variant of a polynucleotide or polypeptide may be a 
10 naturally occurring such as an allelic variant, or it may be a variant that is not known to occur 
naturally. Non-naturally occurring variants of polynucleotides and polypeptides may be 
made by mutagenesis techniques, by direct synthesis, and by other recombinant methods 
known to skilled artisans. 



The invention relates to novel aroA polypeptides and polynucleotides as described in 
greater detail below. In particular, the invention relates to polypeptides and polynucleotides 
of a novel aroA of Streptococcus pneumoniae, which is related by amino acid sequence 
20 homology to Lactococcus laciis 5-enolpyruvoylshikimaie -3-phosphaie synthase (aro A) 
Percent Similarity: 79.673, Percent Identity: 68.224 polypeptide. The invention relates 
especially to aroA having the nucleotide and amino acid sequences set out in Table 1 [SEQ ID 
NO: 1] and Table 1 [SEQ ID NO: 2] respectively, and to the aroA nucleotide sequences of the 
DNA in the deposited strain and amino acid sequences encoded thereby. 



aroA Polynucleotide and Polypeptide Sequences 

(A) Sequences from Streptococcus pneumoniae aroA polynucleotide sequence [SEQ ID 
30 NO: I]. 

5 ' -ATGAAACTAA AAACAAACAT TCGCCATTTA CATGGTATTA TCCGCGTCCCAGGTGACAAG 
TCTATCAGCC ACCGTTCCAT TA.TCTTTGGA AGTTTGGCTG AGGGTGAGAC CAAGGTTTAT 
GATATTCTGC GA.GGTGAAGA CGTTCTTTCGACCATGCAGG TTTTTCGTGA CCTTGGTGTT 
GAAATTGAGG ATAAAGATGG GGTTATTACC GTTCAAGGTG TAGGCATGGC TGGCTTAAAA. 
35 GCGCCGCAAA ATGCCCTTAA TATGGGAAAT TrTGGCACCT CGATTCGCCT GATTTCAGGT 
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TABLE 1 



- 14- 



PCT/USOO/12251 
WO 00/68243 

GTnCTTGCTG GTGCAGATTT CGAAGTAGAG ATGTTTGGAG ATGATAGTCT TTCCAAACGT 
CCTATGGACC GTGTGACCCT TCCACTGAAA. AAAATGGGCG TCAGCATCTC AGGGCAAACT 
GAACGAGACT TGCCTCCCCT TCGCTTAAAA 

GGGACGAAAA ACCTAAGACC TA^TCATTAT GAGTTGCCAA TTGCCTCTGC CCAAGTCAAG 
5 TCAGCCTTGA TGTTTGCAGC CTTACAAGCT AAGGGGGAGTCAGTTATTAT CGAAAAAGAG 
TACACCCG-A ATCATACTGA AGATATGTTG CAACAATTTG GTGGTCATTT AAGTGTGGAT 
GGTAAGAAAA TCACAGTCCA AGGCCCACAA A.AATTGACAG GACAGAAGGT GGTCGTACCA 
GGAGATATTT CCAGTGCAGC CTTTTGGTTA GTCGCAGGTT TGATTGCTCC AAA.TTCTCGT 
CTAGTGCTGC AGAATGTGGG GATAAACGAA ACTCGCACCG GTATTATTGATGTCATTCGT 
10 GCCATGGGTG GAAAATTGGA AATAACTGAA ATCGATCCAG TCGCTAAATC TGCAACCTTG 
ATTGTTGAGT CTTCTGACTT GAAAGGAACA GAGATTTG7G GCGCTTTGAT TCCACGTTTG 
ATTGATGAAT TGCCTATTAT TGCCCTACTT GCGACCCAAG CCCAAGGTGT A.ACAGTTATC 
AAGGATGCTG AGGAGCTCAA GGTCAAGGAA ACAGACCGTA TTCAGGTTGT GGCAGACGCC 
-TAAATAGTA TGGGAGCAGA TATTACTCCT ACGGCAGATG GGATGATTAT CAAAGGAAAA 
IS T-AGCT--T.- ACGGTGCTAG AGTCAATACG TTTGGTGACC ACCGTATCGG CATGATGACA 
GrTATCGCAG CCCTATTGGT TGCAGATGGAGAGGTGGAGG TTGACCGTGC AGA.AGCCATC 
AATACCAGCT ATCCTAGTTT CTTTGATGAT TTGGAGAGCT TGATTCATGG CTAA-3' 

(B) aroA polypeptide sequence deduced from the polynucleotide sequence .n thus table 
20 ISEQ ID NO:2]. 

NH -MKLKTNIRHL HGIIRVPGDK SISHRSIIFG 3LAEGETKV.' DILRGEDVL3TMQVFRDLGV 
EIEDKDGVIT VQGVGMAGLK APONALNMGN SGTSIRLISGVLAGADFEVE MFGDDSLSKR 
PMDRVTLPLK KMGVSISGQT ERDLPPLRLK GTKNLRPIHY ELPIASAQVK SALMFAALQA 
FGESVIIEKE YTRNHTEDMLQQFGGHLSVD GKKITVQGPr. KLTGQKW^rp GDISSAAFWL 
25 VAGLIAPNSRLVLQNVGINE TRTGIIDVIR AMGGPILEITE IDPVAKSATL IVES3DLKGT 
EICGALIPRL IDELPIIALL ATQAQGVTVI J^DAEELKVKE TDRIQWADA LNSMGADITP 
TADGMIIKGK SALHGARVNT FGDHRIGMMT AIAALLVADGEVELDRAEAT NTSYPSFFDD 
LESLIHG-COOH 

30 (C) Polynucleotide sequence embodiments [SEQ ID NO: 1 ]. 

X-{Ri) -ATGAAACTAA AAACAAACAT TCGCCATTTA CATGGTATTA 

TCCGCG?CCCAGGTGACAAG TCTATCAGCC ACCGTTCCAT TATCTTTGGA AGTTTGGCTG 
AGGGTGAGAC CAAGGTTTAT GATATTCTGC GAGGTGA/.GA CGTTCTTTCGACCATGCAGG 
TTTTTCGTGA CCTTGGTGTT GAAATTGAGG ATAAAGATGG GGTTATTACC GTTCAAGGTG 
35 TAGGCATGGC TGGCTTAAAA GCGCCGCAA,A ATGCCCTTA.. TATGGGAAA.T TCTGGCACCT 
CGATTCGCCT GATTTCAGGT GTCCTTGCTG GTGCAGATTT CGAAGTAGAG ATGTTTGGAG 
ATGATAGTCT TTCCAAACGT CCTATGGACC GTGTGACCCT TCCACTGAAA AAAATGGGCG 
TCAGCATCTC AGGGCAAACT GAACGAGACT TGCCTCCCCT TCGCTTAAAA 
GGGACGAAAA ACCTAAGACC TATTCATTAT GAGTTGCCAA TTGCCTCTGC CCA.AGTCAAG 
40 TCAGCCTTGA TGTTTGCAGC CTTACAAGCT AAGGGGGAGTCAGTTATTAT CGAAAAAGAG 
TACACCCGTA ATCATACTGA AGATATGTTG CAACAATTTG GTGGTCATTT AAGTGTGGAT 
GGTAAGAAAA TCACAGTCCA AGGGCCACAA AA^.TTGACAG GACAGA.AGGT GGTCGTACCA 
GGAGATATTT CCAGTGCAGC CTTTTGGTTA GTCGCAGGTT TGATTGCTCC AAATTCTCGT 
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CTAGTGCTGC AGAATGTGGG GATAAACGAA ACTCGCACCG GTATTATTGATGTCATTCGT 
GCCATGGGTG GAAAATTGGA AATAACTGAA ATCGATCCAG TCGCTAAATC TGCAACCTTG 
ATTGTTGAGT CTTCTGACTT GAAAGGAACA GAGATTTGTG GCGCTTTGAT T 
CCACGTTTG ATTGATGAAT TGCCTATTAT TGCCCTACTT GCGACCCAAG CCCAAGGTGT 
5 AACAGTTATC AJ^GGATGCTG AGGAGCTCAA GGTCAAGGAA ACAGACCGTA TTCAGGTTGT 
GGCAGACGCC TTAAATAGTA TGGGAGCAGA TATTACTCCT ACGGCAGATG GGATGATTAT 
CAAAGGAAAA TCAGCTCTTC ACGGTGCTAG AGTCAATACG TTTGGTGACC ACCGTATCGG 
CATGATGACA GCTATCGCAG CCCTATTGGT TGCAGATGGAGAGGTGGAGC TTGACCGTGC 
AGAAGCCATC AATACCAGCT ATCCTAGTTT CTTTGATGAT TTGGAGAGCT TGATTCATGG 
10 CTAA- (R2)n-'^ 
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(D) Polypeptide sequence embodimenls fSEQ ID NO:2]. 

X- (R^) n-l^^'-^^-™IRHL HGIIRVPGDK SISHRSIIFG SLAEGETKVY 
DILRGEDVLSTMQVFRDLGV EIEDKDGVIT VQGVGMAGLK APQNALNMGN 
SGTSIRLISGVLAGADFEVE MFGDDSLSKR PMDRVTLPLK KMGVSISGQT ERDLPPLRLf: 

5 GTKJ^LRPIHy ELPIASAQVPC SALMFAALQA KGESVIIEKE YTPJ^HTEDMLQQFGGHLSVD 
GKKITVQGPQ KLTGQKVWP GDISSAAFV/L VAGLI APNSRLVLQWGINE TRTGIIDVIR 
AI^GGKLEITE IDPVAKSATL IVESSDLKGT EICGALIPRL IDELPIIALL ATQAQGVTVI 
KDAEELKVKE TDRIQWADA LIISMGADITP TADGMITKGK SALHGARVNT FGDHRIGMKT 
AIAALLVADGEVELDRAEAI NTSYPSFFDD LESLIHG 
10 -(R2)n-Y 

(E) Sequences from Streptococcus pneumoniae aroA polynucleotide ORF sequence 
[SEQ ID NO:3]. 

5 ' -AGCTTGATCG TCCCAGGTGA CAAGTCTATC AGCCACCGTT CCATTATCTT 
15 TGGAAGTTTG GCTGAGGGTG AGACCAAGGT TTATGATA.TT CTGCGAGGTG AACACGTTCT 
TTCGACCATG CAGGTTTTTC GTGACCTTGG TGTTGAAATT GAGGATAAAG ATGGGGTTAT 
TACCGTTCAA GGTGTAGGCA TGGCTGGCTT AAAAGCGCCG CAAAATGCCC TTAATATGGG 
AAATTGTGGC ACCTCGATTG 

GCCTGATTTC AGGTGTGCTT GCTGGTGCAG ATTTCGAAGT AGAGATGTTT GGAGA.TGATA 

20 GTCTTTCCAA ACGTCCTATG GACCGTGTGA CCCTTCCACT 

GAAAAAAATG GGCGTCAGCA TCTCAGGGCA AACTGAACGA GACTTGCCTC 
CCCTTCGCTT TAAAAGGGAC G.AAAAAGGTA AGACGTATtc attatgagtt gccaatcgcc 
tctgcccaag tcaagticagc cnnnnnnnnn nnnnnnnnn CTAAGGGG GAGTCAGTTA 
TTATCGAAAA AGAGTACACC CGTAATCATA CTGAAGATAT GTTGCAACAA TTTGGTGGTC 

25 ATTTAAGTGT GGATGGTAAGAAAATCACAG TCCAAGGGCC ACAAAAATTG ACAGGACAGA 
AGGTGGTCGT ACCAGGAGAT ATTTGCAGTG CAGCCTTTTG GTTAGTCGCA GGTTTGATTG 
CTCCAAATTC TCGTCTAGTG CTGCAGAATG TGGGGATAAA CGAAACTCGC ACCGGTATTA 
TTGATGTCAT TCGTGCCATG GGTGGAAAAT TGGAAATAACTGAAATCGAT CCAGTCGCTA 
AATCTGCAAC CTTGATTGTT GAGTCTTCTGACTTGAAAGG AACAGAGATT TGTGGCGCTT 

30 TGATTCCACG TTTGATTGAT 

GAATTGCCTA TTATTGCCCT ACTTGCGACC CAAGCCCAAG GTGTAACAGT TATCAAGGAT 
GCTGAGGAGC TCAAGGTCAA GGAAACAGAC CGTATTCAGG TTGTGGCAGA CGGGTTAAAT 
AGTATGGGAG CAGATATTAC TCCTACGGCA GATGGGATGA TTATCAAAGG AAAATCAGCT 
CTTCACGGTG CTAGAGTCAATACGTTTGGT GACCACCGTA TCGGCATG7\T GACAGCTATC 

35 GCAGCCCTAT TGGTTGCAGA TGGAGAGGTG GAGCTTGACC GTGCAGAAGC CATCAATACC 
AGCTATCCTA GTTTCTTTGA TGATTTGGAG AGCTTGATTC ATGGCTAA 



(F) 



aroA polypeptide sequence deduced from the polynucleotide ORF sequence in this 



40 



table [SEQ ID NO:4]. 
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NH -SLIVPGDKSI SHRSIIFGSL AEGETK\^DI LRGEHVLSTM QVFRDLGVEIEDKDGVITVQ 

GVGMAGLKAP QNALNMGNSG TSIRLISGVL AGADFEVEMF GDDSLSKRPK DRVTLPLKt^LK 
GVSISGQTER DLPPL.RFKRD EKPKTYSLXV 

ANCLCPSQVS XXXXXXXXKG ESVIIEKEYT RNHTEDHLQQ FGGHLS\^GK KITVQGPQKL 

5 TGQKVWPGD ISSAAFWLVA GLIAPNSRLV LOK-VGINETR TGIIDVIRAI-I GGKLEITEID 

PVAKSATLIV ESSDLKGTEI CGALIPRLID ELPIIALLAT QAQGVTVIKD AEELKVKETD 

RIQWADALN SMGADITPTA DGMIIKGKSA LHGARVIJTFG DHRIGiyn>!TAI AALLVADGEV 
ELDRAEAINTSYPSFFDDLE SLIHG-COOH 



10 _ 

Deposited materials 

A deposit containing a Streptococcus pneumoniae 0100993 strain has been deposited 
with the National Collections of Industrial and Manne Bacteria Ltd. (herein "NCIMB"), 23 St. 
Machar Drive, Aberdeen AB2 IRY, Scotland on 1 1 April 1996 and assigned deposit number 
15 40794. The deposit was described as Streptococcus peumnoiae 0100993 on deposit. On 17 
April 1996 a Streptococcus peumnoiae 0100993 DNA librar>^ in E, coli was similarly 
depositedwith the NCIMB and assigned deposit number 40800.. The Streptococcus 
pneumoniae strain deposit is referred to herein as "the deposited strain*' or as "the DNA of the 
deposited strain." 

20 The deposited strain contains the full length aroA gene. The sequence of the 

polynucleotides contained in the deposited strain, as well as the amino acid sequence of the 
polypeptide encoded thereby, are controlling in the event of any conflict with any description 
of sequences herein. 

The deposit of the deposited strain has been made under the terms of the Budapest 
25 Treaty on the International Recognition of the Deposit of Micro-organisms for Purposes of 
Patent Procedure. The strain will be irrevocably and without restriction or condition released 
to the public upon the issuance of a patent. The deposited strain is provided merely as 
convenience to those of skill in the art and is not an admission that a deposit is required for 
enablement, such as that required under 35 U.S.C. §112. 
30 A license may be required to make, use or sell the deposited strain, and compounds 

derived therefrom, and no such license is hereby granted. 
Polypeptides 

The polypeptides of the invention include the polypeptide of Table 1 [SEQ ID NO:2] 
(in particular the mature polypeptide) as well as polypeptides and fragments, paniculariy those 
35 which have the biological activity of aroA. and also those which have at least 707c identity to 
a polypeptide of Table 1 [SEQ ID NOS:2 and 4] or the relevant portion, preferably at least 
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80% identity to a polypeptide of Table 1 [SEQ ID NOS:2 and 4], and more preferably at least 
909c similarity (more preferably at least 909c identity) to a polypeptide of Table 1 [SEQ ID 
NOS:2 and 4] and still more preferably at least 95% similanly (still more preferably at least 
957c identity) to a polypeptide of Table 1 [SEQ ID NOS:2 and 4] and also include portions of 
5 such polypeptides with such portion of the polypeptide generally containing at least 30 amino 
acids and more preferably at least 50 amino acids. 

The invention also includes polypeptides of the formula set forth in Table 1 (D) [SEQ 
ID NO:2] wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is 
hydrogen or a metal, R] and R2 is any amino acid residue, and n is an integer between 1 and 
10 1000. Any stretch of amino acid residues denoted by either R group, where R is greater than 
1, may be either a heteropolymer or a homopolymer, preferably a heieropolymer, 

A fragment is a variant polypeptide having an amino acid sequence that entirely is the 
same as part but not all of the amino acid sequence of the aforementioned polypeptides. As 
with aroA polypeptides fragments may be "free-standing," or compnsed within a larger 
15 polypeptide of which they form a part or region, most preferably as a single continuous region, 
a single larger polypeptide. 

Preferred fragments include, for example, truncation polypeptides having a portion of 
an amino acid sequence of Table 1 [SEQ JD NOS:2 and 4], or of variants thereof, such as a 
continuous series of residues that includes the amino terminus, or a continuous series of 
20 residues that includes the carboxyl terminus. Degradation forms of the polypeptides of the 
invention in a host cell, particularly a Streptococcus pneumoniae, are also preferred. Further 
preferred are fragments characterized by structural or functional attributes such as fragments 
that comprise alpha-helix and alpha-helix forming regions, beta-si leet and beta-sheet-forming 
regions, turn and turn-forming regions, coil and coil-forrmng regions, hydrophilic regions, 
25 hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. 

Also preferred are biologically active fragments which are those fragments that 
mediate activities of aroA, including those with a similar activity or an improved activity, or 
with a decreased undesirable activity. Also included are those fragments that are antigenic or 
30 immunogenic in an animal, especially in a human. Particularly preferred are fragments 
comprising receptors or domains of enzymes that confer a function essential for viability of 
Streptococcus pneumoniae or the ability to initiate, or maintain cause disease in an individual, 
particularly a human. 
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"X" or "Xaa" may also be used in describing certain polypeptides of the invention. 
"X" and "Xaa" mean thai any of the twenty naturally occurring amino acids may appear at 
such a designated position in the polypeptide sequence. 

Variants that are fragments of the polypeptides of the invention may be employed for 
5 producing the corresponding full-length polypeptide by peptide synthesis; therefore, these 
variants may be employed as intermediates for producing the full-length polypeptides of the 
invention. 

Polynucleotides 

Another aspect of the invention relates to isolated polynucleotides, including the full 

10 length gene, that encode the aroA polypeptide having a deduced amino acid sequence of Table 
I [SEQ ID NOS:2 and 4] and polynucleotides closely related thereto and variants thereof. 

Using the information provided herein, such as a polynucleotide sequence set out in 
Table 1 [SEQ ID NOSil and 3], a polynucleotide of the invention encoding aroA polypeptide 
may be obtained using standard cloning and screening methods, such as those for cloning and 

15 sequencing chromosomal DNA fragments from bacteria using Streptococcus pneumoniae 
0100993 cells as starting maienal, followed by obtaining a full length clone. For example, to 
obtain a polynucleotide sequence of the invention, such as a sequence given in Table 1 
[SEQ ID NOS:l and 3], typically a library of clones of chromosomal DNA of 
Streptococcus pneumoniae 0100993 in E.coli or some other suitable host is probed with a 

20 radiolabeled oligonucleotide, preferably a 17-mer or longer, derived from a partial 
sequence. Clones carrying DNA identical to that of the probe can then be distinguished 
using stringent conditions. By sequencing the individual clones thus identified with 
sequencing primers designed from the original sequence it is then possible to extend the 
sequence in both directions to determine the full gene sequence. Conveniently, such 

25 sequencing is performed using denatured double stranded DNA prepared from a plasmid 
clone. Suitable techniques are described by Maniatis, T., Fritsch. E.F. and Sambrook et al., 
MOLECULAR CLONING. A LABORATORY MANUAL. 2nd Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). (see in particular Screening By 
Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13,70). 

30 Illustrative of the invention, the polynucleotide set out in Table 1 [SEQ ID NOT] was 
discovered in a DNA library derived from Streptococcus pneumoniae 0100993. 

The DNA sequence set out in Table 1 [SEQ ID NOS:l] contains an open reading 
frame encoding a protein having about the number of amino acid residues set fonh in Table 1 
[SEQ ID NO:2] with a deduced molecular weight that can be calculated using amino acid 
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residue molecular weight values well known in the art. The polynucleotide of SEQ ID NO: 1, 
between nucleotide number 1 through number 1281 encodes the polypeptide of" SEQ ID 
NO:2. The stop codon begins at nucleotide number 1284 of SEQ ID NO:l-l 1281 

The aroA protem of the invention is structurally related to other proteins of the aro (5- 
5 enolpyruvylshikimate-3-phosphate synthase) family, as shown by the results of sequencing the 
DNA encoding aro A of the deposited strain. The protein exhibits greatest homology to 
Lactococcus lactis 5-enolpyruvoylshikimate -3-phosphate synthase (aro A) (showing a percent 
similarity of 79.7% and a percent identity of 68.2%) protem among known proteins. Other 
related sequences are: 



10 


Acc. No. 


Species 




X78413 


LAactis aro A 




M80245 


B.subtilis aroA 




Z29339 


DModosus aro A 




119982 


Sequence 14 from patent US 5.512,466 


15 


L05004 


Staphylococcus aureus aro A 




X75325 


Synechocystis sp. aro A 




X77019 


P.pseudomallei aro A 




D90914 


Synechocystis sp. aro A 




X89371 


C. jejuni aro A 


20 


L46372 


Yersinia pestis aroA 




Related sequences are described in Sharps. Analytical Biochem 140: 183-189, 1984; 



Majumder et al., Eur. J. Biochem 229: 99-106, 1995; OXIonnell et al., J. Gen Microbiol. 
139: 1449-1460, 1993; Oyston et al., "Immunization with live recombinant Salmonelle 
typhimurium aroA producing Fl antigen protects against plague," 1995; and Rogers ei al.. 



25 ^'Amplification of the aroA gene from Escherichia coli results in tolerance to the herbicide 
glyphosphaie," 1983. 

The invention provides a polynucleotide sequence identical over its entire length to 
the coding sequence m Table I [SEQ ID NO:l or SEQ ID NO:2]. Also provided by the 
invention is the coding sequence for the mature polypeptide or a fragment thereof, by itself as 

30 well as the coding sequence for the mature polypeptide or a fragment in reading frame with 
other coding sequence, such as those encoding a leader or secretor\' sequence, a pre-, or pro- 
or prepro- protein sequence. The polynucleotide may also contain non-coding sequences, 
including for example, but not limited to non-coding 5' and 3' sequences, such as the 
transcribed, n on -translated sequences, termination signals, ribosome binding sites, sequences 

35 that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence which 
encode additional amino acids. For example, a marker sequence that facilitates purification 
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of the fused polypeptide can be encoded, in certain embodiments of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and 
described in Gentz et aL. Proc. Natl. Acad. Scl, USA 86: 821-824 (1989), or an HA tag 
(Wilson et uL. Cell 37: 161 (1984). Polynucleotides of the invention also include, but are not 
5 limited to, polynucleotides compnsmg a structural gene and its naturally associated sequences 
that control gene expression. 

A preferred embodiment of the invention is a polynucleotide of composing 
nucleotide 1 to 1281 or 1284 set forth m SEQ ID NO:l of Table 1 which encode the aroA 
polypeptide. 

10 The invention also includes polynucleotides of the formula set forth in Table 1 

(C)[SEQ ID NO: I] u^herein, at the 5' end of the molecule, X is hydrogen, and at the 3' end of 
the molecule, Y is hydrogen or a metal, R] and R2 is any nucleic acid residue, and n is an 
integer between 1 and 1000. Any stretch of nucleic acid residues denoted by either R group, 
where R is greater than 1, may be either a heteropolymer or a homopolymer, preferably a 
15 heteropolymer. 

The term "polynucleotide encoding a polypeptide" as used herein encompasses 
polynucleotides that include a sequence encoding a polypeptide of the invention, particularly a 
bacterial polypeptide and more particularly a polypeptide of the Streptococcus pneumoniae 
aroA having the amino acid sequence set out in Table 1 [SEQ ID NO:2]. The term also 
20 encompasses polynucleotides that include a single continuous region or discontinuous regions 
encoding the polypeptide (for example, interrupted by integrated phage or an insertion 
sequence or editing) together with additional regions, that also may contain coding and/or 
non-coding sequences. 

The invention further relates to vanants of the polynucleotides described herein that 
25 encode for variants of the polypeptide having the deduced amino acid sequence of Table 1 
[SEQ ID NO:2]. Variants that are fragments of the polynucleotides of the invention may be 
used to synthesize fulMength polynucleotides of the invention- 
Further particularly preferred embodiments are polynucleotides encoding aroA 
vanants, that have the amino acid sequence of aroA polypeptide of Table 1 [SEQ ID NO:2] in 
30 which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are substituted, 
deleted or added, in any combination. Especially preferred among these are silent 
substimtions, additions and deletions, that do not alter the properties and activities of aroA. 

Further preferred embodiments of the invention are polynucleotides that are at least 
709c identical over their entire length to a polynucleotide encoding aroA polypeptide having 
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an amino acid sequence set out in Table 1 [SEQ ID NOS:2 and 4), and polynucleotides that 
are complementary to such polynucleotides. Alternatively, most highly preferred are 
polynucleotides that comprise a region that is at least 80% identical over its entire length to a 
polynucleotide encoding aroA polypeptide of the deposited strain and polynucleotides 

5 complementary thereto. In this regard, polynucleotides at least 90% identical over their entire 
length to the same are particularly preferred, and among these particularly preferred 
polynucleotides, those with at least 95% are especially preferred. Furthermore, those with at 
least 97%; are highly preferred among those with at least 95%. and among these those with at 
least 98% and at least 99%; are particularly highly preferred, with at least 99% bemg the more 

10 preferred. 

Preferred embodiments are polynucleotides thai encode polypeptides that retain 
substantially the same biological function or activity as the mature polypeptide encoded by the 
DNA of Table 1 [SEQ ID NO: 1]. 

The invention further relates to polynucleotides that hybndize to the herein above- 
15 described sequences. In this regard, the invention especially relates to polynucleotides that 
hybridize under stringent conditions to the herein above-descnbed polynucleotides. As herein 
used, the terms "stringent conditions" and "stringent hybndization conditions" mean 
hybridization will occur only if there is at least 95% and preferably at least 97% identity 
between the sequences. An example of stringent hybridization conditions is overnight 
20 incubation at 42°C in a solution comprising: 50% formamide. 5x SSC (150mM NaCI, 
15mM tri.sodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 micrograms/ml denatured, sheared salmon <vcnn DNA, followed by 
washing the hybridization support in O.lx SSC at about 65°C. Hybridization and wash 
conditions are well known and exemplified in Sambrook. ei al.. Molecular Cloning: A 
25 Laboratory Manual, Second Edition, Cold Spring Harbor. N.Y., (1989), particularly 
Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence set forth in SEQ ID NO:l or SEQ ID NO:3 
30 under stringent hybridization conditions with a probe having the sequence of said 
polynucleotide sequence set forth in SEQ ID NO: I or a fragment thereof; and isolating said 
DNA sequence. Fragments useful for obtaining such a polynucleotide include, for example, 
probes and primers described elsewhere herein. 
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As discussed additionally herein regarding polynucleotide assays of the invention, for 
instance, polynucleotides of the invention as discussed above, may be used as a hybridization 
probe for RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic clones 
encoding aroA and to isolate cDNA and genomic clones of other genes that have a high 
5 sequence similarity to the aroA gene. Such probes generally w.U compr.se at least 15 bases. 
Preferably, such probes will have at least 30 bases and may have at least 50 bases. 
Particularly preferred probes will have at least 30 bases and will have 50 bases or less. 

For example, the coding region of the aroA gene may be isolated by screening using 
the DNA sequence provided in SEQ ID NO: 1 to synthesize an oligonucleotide probe. A 
1 0 labeled oligonucleotide having a sequence complementary to that of a gene of the invention is 
then used to screen a library- of cDNA. genomic DNA or mRNA to determine which members 
of the library the probe hybridizes to. 

The polynucleotides and polypeptides of the invention may be employed, for 
example, as research reagents and materials for duscovery of treatments of and diagnostics for 
15 disease, particularly human disease, as further discussed herein relating to polynucleotide 
assays. 

Polynucleotides of the invention that are oligonucleotides derived from the 
.sequences of SEQ ID NOS:l and/or 2 may be used in the processes herein as described, but 
preferably for PGR, to determine whether or not the polynucleotides identified herein in 
20 whole or in part are transcribed in bacteria in infected tissue. It is recognized that .such 
sequences will also have utility in diagnosis of the stage of infection and type of infection 

the pathogen has attained. 

The invention also provides polynucleotides that may encode a polypeptide that is the 
matui^ protein plus additional amino or carboxyl-ierm.nal amino acids, or amino acids interior 
25 to the mature polypeptide (when the mature form has more than one polypeptide chain, for 
instance). Such sequences may play a role m processing of a protein from precursor to a 
mature form, may allow protein transport, may lengthen or shorten protein half-life or may 
facilitate manipulation of a protein for assay or production, among other things. As generally 
is the case in viva, the additional amino acids may be processed away from the mature protein 

30 by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 

-24- 
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In addition to the standard A, G. C. T/U representations for nucleotides, the term "N" 
may also be used in describing cenam polynucleotides of the invention. "N" means that any 
of the four DNA or RNA nucleotides may appear at such a designated position in the DNA or 
RNA sequence, except it is preferred that N is not a nucleic acid that when taken in 
5 combination with adjacent nucleotide positions, when read in the correct reading frame, 
would have the effect of generating a premature termination codon in such reading frame. 

In sum, a polynucleotide of the invention may encode a mature protein, a mature 
protein plus a leader sequence (which may be referred to as a preprotein), a precursor of a 
mature protein having one or more prosequences that are not the leader sequences of a 
10 preprotein, or a preproprotein, which is a precursor to a proprotein. having a leader sequence 
and one or more prosequences. which generally are removed dunng processing steps that 
produce active and mature forms of the polypeptide. 
Vectors, host cells, expression 

The invention also relates to vectors that compnse a polynucleotide or 
1 5 polynucleotides of the invention, host cells that are genetically engineered with vectors of the 
invention and the production of polypeptides of the invention by recombinant techniques. 
Cell-free translation systems can also be employed to produce such proteins using RNAs 
derived from the DNA constructs of the invention. 

For recombinant production, host cells can be genetically engineered to incorporate 
20 expression systems or portions thereof or polynucleotides of the invention. Introduction of a 
polynucleotide into the host cell can be effected by methods described in many standard 
laboratoo' manuals, such as Davis et al. BASIC METHODS IN MOLECULAR BIOLOGY, 
(1986) and Sambrook et al.. MOLECULAR CLONING: A LABORATORY MANUAL. 2nd Ed., 
Cold Spring Harbor Laboratory Press, Cold Spnng Harbor, N.Y. (1989), such as, calcium 
25 phosphate transfection, DEAE-dextran mediated transfection, transvection, microinjection, 
cationic lipid-mediated transfection. electroporation. transduction, scrape loading, ballistic 

introduction and infection. 

Repre.sentative examples of appropriate hosts include bacterial cells, such as 
streptococci, staphylococci, cnterococci E. colL streptomyces and Bacillus mbiilis cells; 
30 fungal cells, such as yeast cells and Aspergillus cells: insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa. C127. 3T3, BHK. 293 and 
Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used to produce the polypeptides of the 
invention. Such vectors mclude, among others, chromosomal, episomal and virus-derived 
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vectors, c.,., vectors denved from bactenal plasmids. from baclenophage. from transposons. 
from veast episomes. from .nsert.on elements, from yeast chromosomal elements, from v.^ses 
such as baculovruses, papova viruses, such as SV40. vaccmia viruses, adenoviruses, fowl pox 
viruses, pseudorab.es v.nises and retroviruses, and vectors denved from combmations thereof, 
^ such as those derived from plasmtd and bacteriophage genetic elements, such as cosmids and 
phagemids. The expression system constructs may contain control regions that regulate as 
well as engender expression. Generally, any system or vector suitable to maintain, propagate 
or express polvnucleotides and/or to express a polypeptide in a host may be used for 
expression in this regard. The appropnate DNA sequence may be inserted mto the expression 
10 system by any of a variety of well-known and routine techniques, such as, for example, those 
set forth in Sambrook et uL MOLECULAR CLONING. A LABORATORY MANUAL, isupra). 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, 
into the periplasmic space or into the extracellular environment, appropnate secretion signals 
may be incorporated into the expressed polypeptide. These signals may be endogenous to the 
1 5 polypeptide or they may be heterologous signals . 

Polypeptides of the invemion can be recovered and punf.ed from recombinant cell 
culmres by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapat.te 
20 chromatography, and lectin chromatography. Most preferably, high performance liquid 
chromatography is employed for punf.cation. Well known techniques for refolding protem 
may be employed to regenerate active confonnation when the polypeptide is denatured dunng 
isolation and or purification. 
Diagnostic Assays 

This invention is also related to the use of the aroA polynucleotides of the invention 
for use as diagnostic reagents. Detection of aroA -n a eukaryote. particularly a mammal, and 
especially a human, will provide a diagnostic method for diagnosis of a disease. Eukaryotes 
(herein also ' ind.v.dual(s) '). particularly manuals, and especially humans, infected with an 
organism comprising the aroA gene may be detected at the nucle.c acid level by a vanety of 
30 techniques. 

Nucleic acids for diagnosis may be obtained from an infected individuals cells and 
tissues, .such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly 
for detection or may be amplified enzymat.cally by using PGR or other amplification 
technique prior to analysis. RNA or cDNA may also be used in the same ways, bsing 



25 



PCT/USOO/12251 
WO 00/68243 

an^pUfication. charactenzat.on of the spaces and stra.n of proRaryote present .n an .nd.v.dual, 
.ay be made by an analyses of the genotype of the proUaoote gene. Delet.ons and .nsemons 
can be detected by a change .n s.^ of the ampUHed product .n con^panson to the genotype of 
a reference sequence. Pent ntutations can be .dent,f.ed by hybnd.z.ng an^phf.ed DNA to 
5 labeled aroA polynucleotide sequences. Perfectly r^atched sequences can be dist.ngutshed 
from mtsmatched duplexes by RNase d.gest.on or by d.fferences ,n melt.ng temperatures. 
DNA sequence differences may also be detected by alterat.ons tn the electrophoret.c mob.hty 
of the DNA fragments tn gels. wUh or wuhout denatunng agents, or by d.rect DNA 
sequencne. See, e.g., Myers et al.. Sconce, 230: 1242 (1985). Sequence changes at spec.fc 
10 locattons also may be revealed by nuclease protect.on assays, such as RNase and SI 
protecfon or a chem.cal cleavage method. See, e.,.. Cotton et al., Proc. Natl. Acad. Sc.. 
USA, 85: 4397-4401 
(1985). 

Cells carrytn. mutations or polymorph.sms m the gene of the mvent.on may also be 
15 detected at the DNA level by a variety of techniques, to allow for serotypmg, for example. 
For example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT- 
PCR m conjunction with automated detect.on systems, such as, for example. GeneScan. RNA 
or cDNA may also be used for the same purpose. PGR or RT-PCR. As an example, PGR 
primers complementary to a nucleic ac.d encoding aroA can be used to identify and analyze 



20 mutations 



lib. , 
The invention further provides these primers with 1, 2, 3 or 4 nucleotides removed 
from the 5' and/or the 3' end. These pnmers may be used for, among other things, amphfymg 
aroA DNA isolated from a sample denved from an individual. Ti.e primers may be used to 
amplify the gene isolated from an infected individual .such that the gene may then be subject to 
25 various techniques for elucidation of the DNA sequence. In this way, mutations in the DNA 
sequence may be detected and used to diagnose infection and to serotype and/or classify the 
infectious agent. 

-me invention ftirther provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections by Streptococcus pneu,no,uae. and most preferably 
30 otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleura, empyema 
and endocarditis, and most pan.cularly memngU.s, such as for example infection of 
cerebrospinal Hu.d. comprising determining from a sample derived from an individual a 
increased level of expression of polynucleotide having the sequence of Table 1 [SEQ ID 
NO: 1]. Increased or decreased expression of aroA polynucleotide can be measured using 
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any on of the n,ethods well known ,n the art for the quantat.on of polynucleot.des. such as. 
for example, amplificat.on, PGR. RT-PCR. RNase protection. Northern blott.ng and other 

hybridization methods. 

m addition, a d.agnostic assay tn accordance wuh the mvent.on for detectmg over- 
5 express.on of aroA prote.n compared to normal control t.ssue samples may be used to detect 
the presence of an infect.on, for example. Assay techn.ques that can be used to determ.ne 
levels of a aroA protein, tn a sample denved from a host are well-known to those of sk.U m the 
art. Such assay methods tnclude radioimmunoassays, competu.ve-b.ndtng assays. Western 
Blot analysis and ELISA assays. 

10 Antibodies 

The polypeptides of the invention or vanants thereof, or cells expressmg them can be 
used as an .mmunogen to produce antibod.es immanospecific for such polypeptides. 
"Antibodies" as used herein includes monoclonal and polyclonal antibodies, chimeric, single 
Cham, simianized antibodies and humanized antibodies, as well as Fab fragments, including 
1 5 the products of an Fab immunolglobulin expression library . 

Antibodies generated against the polypeptides of the invention can be obtained by 
administering the polypeptides or epitope-beanng fragments, analogues or cells to an ammal. 
preferably a nonhuman, using routine protocols. For preparation of monoclonal antibodies, 
any techmque known in the art that provides antibodies produced by continuous cell line 
20 cultures can be used. Examples include vanous techniques, such as those m Kohler. G. and 
Milstein C Nature 256: 495-497 (1975); Kozbor er aL Immunology Today 4: 72 (1983). 
Cole et al.. pg. 77-96 in MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. 
Liss. Inc. (1985). 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946.778) 
05 can be adapted to produce single chain antibodies to polypeptides of this invention. Also, 
transgenic mice, or other organisms such as other mammals, may be used to express 

humanized antibodies. 

Alternatively phaee display technology may be utilized to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
30 eenes of lymphocytes from humans .screened for possessing anti-aroA or from naive 
Ubraries (McCafferty. J. et al.. (1990). Nature 348. 552-554: Marks. J. et al., (1992) 
Biotechnology 10. 779-783). The affinity of these antibodies can also be improved by 
chain shuffling (Clackson, T. et al., ( 1991 ) Nature 352. 624-628). 
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If two antigen b.nding domains are present each doir^am may be directed agamst a 
different epitope - termed bispecific" antibodies. 

The above-described antibodies may be employed to isolate or to identify clones 
expressing the polypeptides to purify the polypeptides by affinity chromatography. 
5 Thus, among others, antibodies against aroA- polypeptide may be employed to treat 

infections, particularly bacterial infections and especially otitis media, conjunctivitis, 
pneumonia, bacteremia, meningitis, sinusitis, pleural empyema and endocarditis, and most 
particularly meningitis, such as for example infection of cerebrospinal fluid. 

Polypeptide variants include ant.gen.cally, epitopically or immunologically 
equivalent variants that form a particular aspect of this invention. The term ■ ant.genically 
equivalent derivative" as used herein encompasses a polypeptide or ,ts equivalent which 
will be specifically recognized by certain antibodies which, when raised to the protein or 
polypeptide according to the invention, interfere with the immediate physical interaction 
between pathogen and mammalian host. The term -immunologically equivalent derivative 
as used herein encompasses a peptide or its equivalent which when used in a suitable 
formulation to raise antibodies in a vertebrate, the antibodies act to interfere with the 
immediate physical interaction between pathogen and mammalian host. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
amiaen mav be associated, for example by conjugation, with an immunogenic carrier 
protein for example bovine serum albumin (BSA> or keyhole limpet haemocyan.n (KLH). 
Altemativelv a multiple antigenic peptide comprising multiple copies of the protein or 
polvpept.de. or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenic.ty so as to obviate the use of a carrier. 

Preferablv. the antibody or vanant thereof is modified to make it less immunogenic 
in the individual. For example, if the individual is human the antibody may most preferably 
be "humanized"; where the compl.mentanty determining region(s) of the hybridoma- 
derived antibody has been transplanted into a human monoclonal antibody , for example as 
described ,n Jones. P. et al. (1986), Nature 321, 522-525 or Tempest et al., (1991) 

Biotechnology 9, 266-273. 

The use of a polynucleotide of the invention in genetic immunization will 
preferablv emplov a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolff et al., Hum Mol Genet 1992. 1:363, Manthorpe et al.. Hum. Gene Ther. 
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^ ,„ » b,n«i,„g ™l«u.e or convened ,0 produc, can be <.e,=™„ed acc„.,e,> .0 
ihe effectiveness of ihepolemial antagonist. , , .h, 

„ is prefet^l that the agonists and antagontsts of d» ,nve„t,on ac, b, ai.et.ng he 
„versib,e transfotntation of pb„spbo,e„„„pym.ate ,PEP, and sb..,n.,e 3-pi,„,pha.e ,S,P, 
5 to EPSP and inorganic phosphate (Pi). 

Potential antagon.sts inCude stnaU organtc ntoiecules. peptides, poiypcpt.des and 
antibodtes that b,nd to a po,yn„c,eotide ot polypeptide of tbe ,nve„t,on and thereby ,nh,b,t or 
extingftsb tts acttvity. P«entia, »tagonis,s also ma, be s™„ otganic ntoiecnies. a pepttde, 
poiypepttde such as a ciosely teiated ptotein or anttbody that b.nds the sanne sites on a bntd.ng 
,0 ir,e, such as a btnding ..oleeuie. withont tnducng aro^-tnduced acttvtt.es. thereby 
preventtng the aeon of aroA by excluding aroA frombtndtng ^^^^ 

Small organtc molecules of the tnvenl.on preferably ha.c a molecu 
2.000 daltons. more preferably between 3<X, and 1,0C« dal.ons. and most P-'-">--- 
400 and 700 dal.ons. An embodiment provtdes compounds that were no, pubbshed pnor 
,5 the filtn. date of thts application. Another embod.ment provides compounds e.cludtng 
g,yphos:,e <«-phosphonome.hy,glyc,ne, GLP, and^or transtt.on state tnh.bttors or transttton 
State mimelics. 

Potenttal antagonists include a small molecule that binds to and occnptes the b.ndtng 
site of the polypepcide theteby pieventing btndtng to cellular b.nd.ng molecn.es. such that 
,0 normal biological activity ,s prevented. Examples of small molecule, tnclude but are no 
' Itmited .0 small organic n»lecules. peptides or pe,,de-,i.e molecn.es. 0,h„ po .t . 
^agontsts include antisense molecules ,see «a„o, . ^--^2^" Z 
OUGODEOXWVCLEOVDES AS A^ISENSS ,NHWrTORS ur GENE EXPRESSION 
CRC Pn=ss. Boca Raton, FL <,988,. fo, a descnpnon of these molecules,. Preferred potentta, 
25 antagonists tnclude compounds related to and vanants of aroA. 

Each of .he DNA sequences provtded heretn may be used ,n the dtseovery and 
development of an.ibacenal compounds. The encoded protcn. upon expression, can t« 
used a a .arge. fo, .he screening of am,bac,er,a, dnt.s. Additionally, .he DNA sequences 
encodmg the amtno tenrntna, regtons of .he encoded pro.e.n or Sh,„e-D.lgamo or o^r 
30 translation faeilitattng sequences of the resp=c.,ve mRNA can be used .0 construe, 
antisense sequences ,„ control the express.on of the codtng sequence of ttr.erest. 

The inventton also provtdes the use of the polypept.dc. poWnucleottde or tnhtbt.o 
o, .he inven.ion .0 ,n.erfere with the ,nt.,a> phystca, .n.eracon be.w.en a pa.hogen an 
mammaltan host responsible for sequelae of mfecon. In pa„,cular .he molecules of ,he 
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invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
bacteria, to mammalian extracellular matrix proteins on in-dwell.ng devices or to 
extracellular matrix proteins in wounds; to block aroA prote.n-med.aied mammalian cell 
invasion by, for example, mitiating phosphorylation of mammalian tyrosine kinases 
5 (Rosenshine et at.. Infect. Immun. 60:2211 (1992): to block bacterial adhesion between 
mammalian extracellular matnx proteins and bacterial aroA proteins that mediate tissue 
damage and: to block the normal progression of pathogenesis in infections initialed other 
than by the implantation of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to 
10 inhibit and treat otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, 
pleural empyema and endocarditis, and most particularly meningitis, such as for example 
infection of cerebrospinal fluid. 
Vaccines 

Another aspect of the mvention relates to a method for inducing an immunological 
15 response in an individual, particulariy a mammal which comprises inoculating the 
individual with aroA, or a fragment or variant thereof, adequate to produce antibody and/ or 
T cell immune response to protect said individual from infection, particularly bacterial 
infection and most particularly Streptococcus pneumoniae infection. Also provided are 
methods whereby .such immunological response slows bacterial replication. Yet another 
20 aspect of the invention relates to a method of inducing immunological response in an 
individual which comprises delivering to such individual a nucleic acid vector to direct 
expression of aroA. or a fragment or a variant thereof, for expressing aroA. or a fragment or 
a variant thereof in vivo in order to induce an immunological response, such as, to produce 
antibody and/ or T cell immune response, including, for example, cytokine-producing T 
25 cells or cytotoxic T cells, to protect said individual from disease, whether that di.sease ,s 
already established within the individual or not. One way of administenng the gene is by 
accelerating it into the desired cells as a coating on particles or otherwise. 

Such nucleic acid vector may comprise DNA, RNA, a modified nucleic acid, or a 
DNA/RNA hybrid. 

30 A further aspect of the invention relates to an immunological composition which, 

when introduced mto an individual capable or having induced within it an immunological 
response, induces an immunological response in such individual to a aroA or protein coded 
therefrom, wherein the composition compri.ses a recombinant aroA or protein coded 
therefrom comprising DNA which codes for and expresses an antigen of said aroA or 
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p,„,e,n coded ,he,efn>m. The ™™„o,og,ca> response ™y .e .sed ,h„,peu,icaUy o, 
p„phy.acUcany and ».y ,ak= .he f.™ of anubody ,„,™nl.y o, eeUular ,n,p,un,w .uch a. 

lhal arising from CTL or CD4+ T cells. 

A aroA polypep..de or a fragment thereof may be fused wuh c„-pro,e,n whtch may 
no, b, itself produce au.tbod.es. but is capable of stabilizing the first protein and pr<«luc,ng 
a fused proretn which will have tmmunogen.c and protective propertt.s Thus fused 
recombtnant protein, preferably further comprises an anttgen.c co-protetn. such as 
irpoprotein D from He.opmu. i,m.n:... Glutath,o„e-S-tra„sfer.se ,GST, or beta- 
galactostdase, relatively large co-pro,eins wh.ch so.ubtltze the protetn and facthta e 
productton and puriftcatton thereof. Moreover, the co-protetn may act as an adjuvant m .he 
sense of providing a generalized st.mu.at.on of the tntmune system The co-pro,ein may be 
attached to e.ther the ammo or carboxy terminus of the firs, pro.em 

Provided by .his invent.on are compostttons, pan.cularly vaccine compostttons. and 
methods compristng the polypeptides or po.ynucteottdes of the invention and 
im^unostimulatory DNA sequences, such as those descnbed ,n Sato, Y, al. Science .73: 

352(1996). , 

Also provided by thts tnventton are me.hods ustng the described polynucleotide or 
particular fraemems thereof which have been shown to encode non-vartable regions of 
bacterial cell surface prote.ns in DNA constntcts used in such genetic immumza.ion 
experiments it. animal models of infection with S.r.^oc^c. p„«n.on^c will be 
pai^icularly useful for identifying protein epitopes able to provoke a prophylactic or 
therapeutic immune response, 1. ts believed that this approach will allow for .he subsequent 
preparation of monoclonal antibodies of panicular value from .he requisite organ .>f .he 
animal successfully rcststing or clearing infection for the development of prophylactic 
agents or therapeutic treatments of bacterial infection, panicularly S,re„»coca,s 
p„e„™»,„»e infection, in mammals, particularly humans. 

The polypeptide may b. used as an antigen for vaccination of a host to produce 
specific anttbodies which protect agatnst tnvasion of bacteria, for example by bloclcing 
adherence of bacteria to damaged tis.sue. Examples of ti.ssue damage tnclude wounds ,n 
s.i„ or connective tissue caused, e g,, by mechanical, chemical or thermal damage or by 
implantation of indwelling devices, or wounds ,n the mucous membranes, such as the 
mouth, maiTunary glands, urethra or vagina. 

The invention also tncludes a vaccine formulation which comprises an 
immunogenic recombinant protein of the invention together with a suiutble carrier. Since 
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the protein may be broken down m the stomach, it .s preferably administered parenterally. 
including, for example, administration that .s subcutaneous, intramuscular, intravenous, or 
intradermal. Formulations suitable for parenteral administration include aqueous and non- 
aqueous sterile injection solutions which may contain anti-oxidants, buffers, bacienostais 
5 and solutes which render the formulation insotonic wuh the bodily fluid, preferably the 
blood, of the individual: and aqueous and non-aqueous sterile suspensions wh.ch may 
include suspending agents or thickening agents. The formulations may be presented ,n un.t- 
do.se or multi-dose containers, for example, sealed ampules and v.als and may be stored in a 
freeze-dried condition requiring only the addition of the sterile liquid earner immediately 
10 prior to use. The vaccine formulation may also include adjuvant systems for enhancing the 
immunogenicity of the formulation, such as o.l-in water systems and other systems known 
in the art. The dosage will depend on the specific activity of the vaccine and can be readily 
determined by routine experimentation. 

While the invention has been described with reference to certain aroA protein, it is 
15 to be understood that this covers fragments of the naturally occurring protein and similar 
proteins with additions, deletions or substitutions wh.ch do not substantially affect the 
immunogenic properties of the recombinant protein. 
Compositions, kits and administration 

The invention also relates to compositions comprising the polynucleotide or the 
20 polypeptides discus.sed above or their agonists or antagonists. The polypeptides of the 
invention may be employed in combination with a non-sterile or sterile carrier or carriers for 
use with cells, tissues or organisms, such as a pharmaceutical earner suitable for 
administration to a subject. Such compositions compnse, for instance, a media additive or a 
therapeutically effective amount of a polypeptide of the invention and a pharmaceutically 
25 acceptable earner or exc.p.ent. Such carriers may include, but are not limited to, saline, 
buffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The fomiulation 
should suit the mode of administration. The invention further relates to diagnostic and 
pharmaceutical packs and kits compnsing one or more containers filled with one or more of 
the ingredients of the aforementioned compositions of the invention. 
30 ^ Polypeptides and other compounds of the invention may be employed alone or in 

conjunction with other compounds, such as therapeutic compounds. 

The phannaceulical compositions may be administered m any effective, convenient 
manner including, for instance, administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradennal routes among others. 
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m therapy or as a prophylacfc. the act.vc agent may be adi^n.s.ered to an 
individual as an injectable compos.t.on, for example as a sier.le aqueous dispersion. 

preferably isotonic. 

Alternatively the composition may be formulated for topical application 
5 for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may coniam appropriate 
conventional additives, including, for example, preservatives, solvents to ass.st drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
10 or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to mammals, and particularly humans, it is expected that the 
daily dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg. typically around 
15 1 mg/kg. The physician in any event will determine the actual dosage which will be most 
suitable%or an individual and will vary with the age, weight and response of the particular 
individual. The above dosages are exemplary of the average case. There can. of course, be 
individual instances where higher or lower dosage ranges are merited, and such are within 

the scope of this invention. 
20 In-dwelling devices include surgical implants, prosthetic devices and catheters, i.e., 

devices that are introduced to the body of an individual and remain in position for an 
extended time. Such devices include, for example, artificial joints, heart valves, 
pacemakers, vascular grafts, vascular catheters, cerebrospinal fluid .shunts, urinary 
catheters, continuous ambulatory peritoneal dialysis (CAPD) catheters. 
25 The composition of the invention may be administered by injection to achieve a 

systemic effect against relevant bacteria shortly before insertion of an in-dwelling device. 
Treatment may be continued after surgery during the in-body time of the device. In 
addition, the composition could also be used to broaden perioperative cover for any surgical 
technique to prevent bacterial wound infections, especially Streptococcus pneumoniae 

30 wound infections. 

Many orthopaedic surgeons consider that humans with prosthetic joints should be 
considered for antibiotic prophylaxis before dental treatment that could produce a 
bacteremia. Late deep infection is a serious complication sometimes leading to loss of the 
prosthetic joint and is accompanied by significant morbidity and mortality. It may therefore 
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be possible to extend the use of the active agent as a replacement for prophylactic 

antibiotics in this situation. 

In addition to the therapy described above, the compositions of this invention may 
be used generally as a v.ound treatment agent to prevent adhesion of bacteria to matrix 
5 proteins exposed in wound tissue and for prophylactic use in dental treatment as an 
alternative to, or in conjunction with, antibiotic prophylaxis. 

Alternatively, the composition of the invention may be used to bathe an indwelling device 
immediately before insertion. The active agent will preferably be present at a concentration 
of l^g/ml to lOmg/ml for bathing of wounds or indwelling devices. 
10 A vaccine composition is conveniently in injectable form. Conventional adjuvants 

may be employed to enhance the immune response, A suitable unit dose for vaccination is 
0.5-5 microgram/kg of antigen, and such dose is preferably administered 1-3 times and 
with an interval of 1-3 weeks. With the indicated dose range, no adverse toxicological 
effects will be observed with the compounds of the invention which would preclude their 
15 administration to suitable individuals. 

Each reference disclosed herein is incorporated by reference herein in its entirety. 
Any patent application to which this application claims priority is also incorporated by 
reference herein in its entirety. 

20 EXAMPLES 

The examples below are earned out using standard techniques, which are well known 
and routine to those of .skill m the art, except where otherwise described in detail. The 
examples are illustrative, but do not limit the invention. 
25 Example 1 Strain selection, Library Production and Sequencing 

The polynucleotide having the DNA sequence given in SEQ ID NO:l was obtained 
from a library of clones of chromosomal DNA of Streptococcus pneumoniae in £. coU. The 
sequencing data from two or more clones containing overlapping Streptococcus pneumoniae 
DNAs was used to construct the contiguous DNA sequence ,n SEQ ID NO:l. Libraries 
30 may be prepared by routine methods, for example: 
Methods 1 and 2 below. 

Total cellular DNA is isolated from Streptococcus pneumoniae 0100993 according 
to .standard procedures and size-fractionated by either of two methods. 
Method 1 
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The library is amplified by standard procedures. 

...s ..op„.e .0 ...e , ~ ^^^^^^^^ 

accord.ng .o M.ndard procedures. EcoRI Imker. 

,he„ „««d imo .he vec.or Lambda ZapU ,..a, have bee, cu, .„h F.coRl, 
tc.„;d b, sundard procedure, and .... ,n.c.d .e pacU.ed l.brar. The „brar> 
is amplified by standard procedures. 
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„Z an.* C.,™c«H..I.»= S.ead,-S.a.e Kine.ic Stud.e. of =- 
E„o>pyruv,.sh-.Uin,a...3.phosph,.e synthase from 

,.HLp,r„v„sh«e.3.phospha.e ,EPSP, .,n,ha» ,BC .5 , . .he 
e„.y.e ,n .he aroma,, amino acid b.osyn.hes. pa,h»ay «he .h.k.n.a.e pa. .a, <• 

;r„.n. ca.,>.e. au uhu- rever.b,e ,ra„s.or.a„o„ o, P-^--;-™;- ^ 

, h„,r'!1PlloEPSP and inoffanicphospha.e (P.) (Schemer). 
,PEP, and sh,k,ma.e 3-phospha.e (S3P) «> ,„,d.»pec.™m, 
,„,e„se in.e„s. in .his enzyme has been provo. d by .he s .y 
pos..emergence he,b,cide glyphosa^ (N-phosphono.ne,hylgiyc,ne, GLP, 
enzvme ,2, by r„nc.ion,„g as a „ansi.,on.s.a.e inh,b,.o, ,3, (Scheme 2 

• Provided here,„ is .he iden.if.ca.io. of EPSP syn.h.se ,n ^"-""»"'«- 

Its kineuc nice r K r-i P -md resoective enzyme reaction 

reacions. The inhibi.ion pa..en,s of .he enzyme by GLP and ..spec. 

pr^ducaredescnbed. „vers,blc .he enzvme can be assayed in 

Since .he EPSP syn.hasereac..on.s freely revers.hic.. . 

^„ forwa. and reverse direcons. in .he forward rcacon. — "—^^ , 
™„„o.ed by nuoresccnce change when conpied ,c ,he "'^--^'^^^ 
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and lacta. dehydrogenase and the change .n absor.ance . 340 nn. (NADH, was defected 

,n the fonvard reaction. GLP is a cotnpetU.ve .nhibUor wUh respect to PEP. but an 
uncompetitive inhibitor relative to S3P (Figure 1). 
5 Product tnh,bition ,n the forward react.on by EPSP shows that EPSP .s a 

cotnpet,t,ve inhibitor versus both PEP and S3P (Figure 2). 

m the reverse react.on, GLP led to uncompet.t.ve inhibition versus EPSP. but 
noncompetitive inhibition versus Pi (Figure 3). 

Product inhibition m the reverse reaction by S3P shows that S3P competitively 
10 inhibits the enzyme versus EPSP. but appears to be an uncompetitive inhibitor versus Pi 

(Fieure 4). , ^ 

The panems of inhibiuon of S r^.^o^^a, EPSP ,,n,ha« and .h. co-respond,,,, 

kinetic constants are summarized in Table I. , ^ . 

Based on these results, we propose a random sequential mechanism for the forward 
15 reaction of 5. pneumoniae EPSP synthase (Scheme 3); but the cu.ent data are not 
sufficient to pinpoint the mechanism in the reverse reaction. 

The herbicide glyphosate is an inhibitor of S. pneumoniae EPSP synthase. 
The inhibition of EPSP synthase by glyphosate in both forward and reverse reactions has 
been characterized by steady-state kinetics. 

Product inhibition of the enzyme m both directions has also been characterized. 
The results support a random sequential mechanism in the forward reaction, but deducing 
the mechanism in the reverse react.on requires further investigation. 

A number of compounds were screened against EPSP synthase ,n .uro and 
shown to comparably inhibit the enzyme. MlCs against S.pneumomae and E.coU also 
.5 demostarted that these compounds were active antibacterial compounds. 
' Analysis of these compounds indicates they are active site directed and one compound has 
been shown to be competitive with the shikimate-3-phosphate substrate. 
Tabie 1. inhibition patterns and Kinetic constants of S. pneumoniae EPSP synthase. 
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Competitive 



2,8+0.3 
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Reverse 



GLP vs. S3P 
EPSP vs. PEP 


Uncompeiitive 
Competitive 


65.6±2.7 
329±26 


EPSP vs. S3P 


Competitive 


44.()±2.4 


GLPvs EPSP 


Uncompetitive 


625±23 


GLP vs. ri 


Noncompetitive 


n36±60 


S3P vs. EPSP 


Competitive 


12.8±2.1 


S3P vs. Pi 


Uncompetitive 


4936+158 
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C.: 0, .e ..u ,n.o .n GoPU ,..06, B„*ac„s So.w^e U... T.e 

.n, .he best f.l of the data to the respective models, 
inhibition patterns represent the best lit oi 

References for Example 2 

, W . WaM,s. N. G.. MazzuUa, M. ^, a.a,l.e,, A. F.. Zhang, L., Uu, W. S„ 

Kallento,H„ and Payne. D, J, (1599) m»„.«np, 
, W*..M.R.<.992, P™c.«.WSd.«SA89.488«887 
; Boc«ocR.M.R.andCos.ins,..R.<l983, F£BS t.„. 154. >-7-.-« 
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